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ABSTRACT 



Rasch models for fundamental measurement in the 



psychological sciences are derived from the principle of specific 
objectivity, the requirement that the parameter value representing 
each component in a test situation be independent of the other 
components. The dichotomous Rasch model for two-faceted analysis, 
applicable to conventional paper-and-pencil tests, is constructed. 
The many-facetv^d Rasch model is also derived by means of a 
three-faceted example, comprising judges, examinees, test items, and 
a rating scale, which is applicable to many judging situations. Any 
other particular form of the many-faceted model may also be derived 
in a similar manner. (Author/TJH) 
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Abstract: 

Rasch Bodels for fundaaental aeasuresent in the psycboloilical sciences are 
derived froa the principle of specific objectivityi the requiresent that the 
parameter value representing each coaponent in a test situation be 
independent of the other cosponents. The dichotosous Basch vodel for 
two- face ted analysis » applicable to conventional paper-and-pencil tests, is 
constructed. The aany-faceted Sasch aodel is also derived by seans of a 
t^ree-faceted exaaple, coaprising judges^ exaaineesi test iteas and a rating 
scale, which is applicable to aany judging situations. 

Key-words: Rasch aeasureaenti Objectivity, Rating scales 



I. Introduction. 

Qeorg Rasch has soae wise words to say on the subject of objectivity which 
will guide us in this discussion: 

••The concept of *objectivity* raises fundamental problem in all sciences. 
For a statement to be scientific, 'objectivity* is required. However, 
exactly what 'objectivity* seans is disputed aacng i^ilosopbers and I aa not 
going to enter into that debate** (Rasch 1964 p.l). 

••The coaparison of any two subjects aay be carried out in such a way that no 
other parameters are involved than those of the two subjects - neither the 
parameter of any other subject nor of any of the stiaulus paraaeters. 

••Siailarly, any two stiauli aay be compared independently of all other 
paraaeters than just those of the two stiauli - the paraaeters of all other 
stiauli as well as the paraaeters of the subjects having been replaced by 
observable numbers . 

••it is suggested that coaparisons carried out under such circuastances will 
be designated as 'specif ically objective*. And the saae tera would seen 
appropriate for stateaents about the aodel structure which are independent of 
all of the paraaeters specified in the aodel, the unknown values of thea 
being, in fact, irrelevant for the structure of the aodel^* (Rasch 1966 p. 21). 

••a aodel is not aeant to be true. Even in classical physics aodels are 
temporary •* good enough for soae purposes*^ (Rasch 1964 p. 2). 

Our aia, then, is to characterise each coaponent of a coaplex test situation 
by one paraaeter of fixed, but unknown, value which is objective, that is to 
say independent of the other paraaeters. These paraaeters are to be 
estiaated, or measured, by aeans of a aodel which capitalises on this 
invar iance in the paraaeters. 
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II* Objectivity for a two-faceted test. 

Rasch developed the idea of objectivity in his analysis of an intelligence 
test taken by 1094 recruits to the Danish Aray. (Rasch 1980 p. 62). This was 
a conventional two-faceted test situation: the observations were the 
dichotOBous (right-wrong» 2-0) results of interactions between the objects 
(examinees) and the agents (test itens). 

The derivation of the corresponding Rasch Model » froa objectivity* is 
included here as a deaonstration of the principles involved following 
Wright (passist in particular Wright and Linacre» 1987 )• 

Consider a test in which two objects^ On and Ony respond to numerous 
replications of a dichotoAously scored agent » Ai, The outcoae of this test 
is depicted in Figure !• 



Object On 
0 1 


Object Ob 0 


POO FID 


1 


FOl Fll 



Figure 1. Outcoae of the hypothetical adainistrati<»i of numerous 
replications of an agent to two objects* F!0 is the count of the nuaber of 
tines that object On succeeded on the agent at the saae tise that object On 
failed. 



The only way to detemine the difference between two objects is to observe 
when they perforn differently* In Figure 1, they have perforned in the sane 
way with counts of FOO and Fll; they have perforaed differently with counts 
of FIO and FOl* The coaparison of On and Oa is thus observed through the 
conparison of FIO and F01» for which FOO and Fll provide a quantitative 
context* In whatever way the coafnrison of FIO and FOl is aade» it aust be 
independent of the length of the test* Thus» if the test were to be twice as 
long, with identically proportionate results, then the coaparison between On 
and On woald becoae a conpaiison between 2*F10 and 2*F01* In order for this 
coaparison to be independent of the arbitrary nuaber of replications, the 
conparison between On and On can be est lasted by 

Perforaance of On FIO 

« 

Perforaance of Ob FOl 

This ratio of frequencies is the eapirical but stochastic aanifestation of 
the ratio of the prohjibilities of the corresponding events* For objectivity, 
the responses aade by the objects aust not be influenced by each other and so 
aust be independent* The probabilities corresponding to the frequencies in 
Figure 1 are specified in Figure 2* It is the counts in all 4 cells of 
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Figure 1 which enable the probabilities in Figure 2 to be estiaated. Thur 
the perfonance of On and Oa can be defined in tenu of the probabilities 
corresponding to FIO and FOl, giving 



Perforaance of On 
Perforaance of Oa 



Pnil*PaiO 



PniO*Pail 



(2) 



Object On 
3 1 


Object Oa 0 


PniO*PliiO Pnil*PaiO 


1 


k'niO^Pail Pnil*Pail 



Figure 2. The probabilities of the outcone of the adainistration of agent Al 
to objects On and Om. Pnil is the probability of object ^ responding to 
agent Ai successfully* 

By spt^c.ific objectivity I the cosparison of On and Om aust be independent of 
which particular agent is ised in asking the coaparison. Therefore the 
cosparisor of their perforaance aust be the saae for agent Ai and for agent 
Aj • Therefore 



Perforaance of On Pnil^PaiO Pnjl^PiijO 
— m 3 

Perforaance of On PniO^Pail PnjO'i'Pajl 
Rearranging the terasi 



(3) 



Pnil Pnjl Pail PajO 
s * t 

PniO PnjO PaiO Pajl 



(4) 



This is true for all a, n, i| Thus it is also true if object Oa is object 
00 1 whose aeasure is defined to be at the local origin of the object 
aeasureaent scale i and also true for agent AOt whose calibration is defined 
to be at the local origin of the agent aeasureaent scale* Then 



Pnil PnOl POil POOD 
m « t 

PniC PnOO POiO POOl 



(5) 



Butt POOO/POOl is a constant tera dependent on the relative placeaent of the 
local origins of the object and agent sub-scales i so thatt if they' are 
defined to coincide i then POOO = POOl s 0*5 and POOO/POOl = 1* 
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TheHi once the local origin is definedi PnOl/PnOG is a ratio dependent only 
on object On, so that log(Pn01/PnOO) can be expressed as Bn. Siailarlyi 
log(POiO/POil) is a ratio dependent only on agent Ai, which can be expressed 
as Di. Equation (5) thus becoae^M on taking logarithms and reparaaeterising 
Pnil to be Pnii so that PniO is 1 - Pni, 

log(Pni/(l-Pni)) s Bn - Di (6) 

which is the Rasch aodel for dichotosousi two-faceted data* 



III* Generalizing to lany-faceted nodels using a three-faceted exaaple* 

The derivation >f a particular for« of the Many-faceted aodel demonstrates 
the general pri..ciples by which any other particular fora of the many-faceted 
model can also be derived. The particulrr model to be derived here is 
applicable to a three-faceted test in which each Judge of a panel of judges 
awards a rating to each examinee on each item# 

Consider the performance of two examinees i On and Omi as rated by a 
particular judge, Jj, on replications of the same itemi Ai. In whatever way 
the ratings were originally recorded) they have been receded into K-i-l 
categories ordinally numbered from 0 to K| with each higher numbered category 
representing a higher level of perceived performance i and with each category 
having a r.on zero probability of occurrence. 



Exaainee On 
Categories k 1 


Exaainee Ob k 


Fkk Flk 


1 


-kl Fll 



Figure 3. Pkl represents the cx t of the number of times that examinee On 
is awarded rating k and examinee Jm is rated a 1 by judge Jj across 
replications oi' item Ai| where k > 1. 



The administration of the numerous replications of item Ai is the "test**. The 
performance levels of examinees On and Ou can be c^pared by theAr relative 
frequencies of being rated in the various categories of the rating scale. 
Following the procedure in the discussion of objectivityi let us summarise 
part of their performance by a 2x2 crosa-tabulation of counts of ratings in 
categories k and 1 of the rcting scale, chosen so that category k la 
numerically greater than category 1 and Si) represents a higher performance 
level. This is depicted in Figure 3. 

When both examinees are given the same rating, (Fkk timer: for a rating of k| 
and Fll times for a ratinti of l)i their performance levels are 
indistinguishable. When the examinees are rated differently (Fkl and Flk 



tiaes)! the exaninee with the greater relative frequency of ratings in 
category k, the higher category, is perceived to have the higher ability. In 
cosparing performance levels, we istend that the nuaeric result be 
independent of the ninber of replications. Thus, if the test were to be 
repeated again, and were of the saae length, we would expect to get 
approximately the sase nuateric result. Moreover, if the two tests were then 
to be concatenated, we would again expect to obtain about the save result. 
The division of the two frequencies, Fkl and Flk, is coapatible with this 
expectation because we expect this ratio to be about the saae when the test 
is repeated, and also when the two tests are concatenated. Consequently » the 
cosparative levels of perforaance of examinees On and Oa can be identified by 
the stated ratio, Fkl/Flk. 

Perforaance level of On Fkl 

« — (7) 

Perforaance level of Oa Flk 

In the liait, the ratio of the eapirically observed frequencies, Fkl/Flk, 
becoaes the ratio of the probabilities, Pkl/Plk, where Pkl is the probability 
of exaainee On being given a rating of k and exaainee Oa a rating of 1 on one 
replication of itea i, and Plk is siailarly defined. Thus we define the 
ratio Pkl/PIk to be the ratio of the examinee's perforaances. 

Perforfimce level of On Pkl 

= „- (8) 

Perforaance level of Oa Plk 

But, for objectivity, the ratings given exaaineee Oa and On aust be 
independently awarded by the judge. Consequently, 

Pkl = Pnijk ♦ Paijl (^y 

and 

Plk = Pnijl ♦ Paijk (10) 

where Pnijk is the probability of exaainee On being' given a rating of k on 
itea Ai by judge Jj, and Pnijl, Paijk, Paijl are siailarly defined. Then 

Perforaance level of On Pkl Fnijk PaijJ 
= — ^ ♦ (Ijj 

Performance level of Oa Plk Pnijl l^ijk 

However, also for objectivity, the relative perforaance of exaainees On and 
Oa aust be independent of which particular itea is used to coapare thea. 
Thus, though perforaance levels are initially defined in teras of itea Ai, 
the relative perforaance levels aust have the saae value when defined in 
teras of any conceptually equivalent itea M\ That is 

Perforaance level of On Pnijk Paijl Pni*jk Pai'jl 

♦ = ♦ (12) 

Perforaance level of Oa Pnijl Paijk Pni*jl Pai*jk 
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then 



Pnijk Piijk Pni'jk Pni'jl 

« t * (13) 

Pnijl Piijl Pni'jl PBi'jk 

For objectivity, this ratio of the probabilities of exaainee On being rated 
in categories k and 1 aust be independen*: of whichever exaainee Oa Is used in 
the coapari'son. So, let us consider exaainee 00 with perforaance level at 
the local origin of the ability scale. Siailarly the ratio aust also be 
independent of whichever itea Ai' is used for the coaparison. Thus it aust 
e.i.so be hold for itea AO chosen to have difficulty at the locaJt origin of the 
itea scale. 

Pnijk POijk PnOjk POOjl 

= * « (14) 

Pnijl POijl PnOjl POOjk 

If I instead of coaparing perforaance levels by aeans of iteas^^Ai and Ai', we 
coapare perforaance levels by aeans of the ratings given by judges Cj and Cj* 
over niiaerous replications of itea Ai, then again we expect the relative 
perforaance levels of the exaainees to be aaintained. 

Perforaance level of On Pnijk Paijl Pnij'k Paij'l 
= ♦ , (^5) 

Perforaance level of Oa Pnijl Paijk Pnij'l Paij'k 
so that 

Pnijk Paijk Pnij'k Paij'l 

= « * (16) 

Pnijl Paijl Pnij'l Paij'k 

Again this aust be true if judge Cj' is chosen to be Judge CO with severity 
at the local origin of the severity scale, and exaainee Oa is exaainee 00, 
and when itea Ai is replaced by itea AO. Therefore 

PnOjk POOjk PnOOk POOOl 

^ « * (17) 

PnOjl POOjl PnOOl POOOk 

Furtheraore, for objectivity, the relative severity levels of judges Cj and 
Cj' aust be aaintained whether the judging takes place over nuaerous 
replications of the adainistration of either itea Ai or itea Ai' to the saae 
exaainee On. 

Severity level of Cj' Pnijk Pni'jl Pnij'k Pni'j'l 
= ♦ s , (18) 

St /erity level of Cj Pnijl Pni'jk Pnij'l Pni'j'k 

then 
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Pnijk Pni'jk Pnij^k Pni'j'l 

t t (19) 

Pnijl Pni'jl PtoijU Pni»j»k 

Again this nust be true if judge Cj' is judge CO chosen at the origin of the 
severity scalei and exaainee On is exaainee 00, and itea Ai' is itea A0« 

POijk POOjk POiOk POOOl 

= t ^20) 

POijl POOjl POiOl POOOk 

Substituting (20) and (17) in (14) i and siaplifying 

Pnijk PnOOk POiOk POOjk (POOOOl)^ 

r ♦ ♦ ♦ ( ) (21) 

Pnijl PnOOl FOiOl POOjl (POOOOk) 

which givec a general fora in which each tera is an expression of the 
relationship between a coaponent of a facet and the local origin of a 
subscalei in the context of a particular pair of categories. 



IV* The three-faceted dichotoaous aodel* 

Considering (21) as a dichotoaous aodel where ksl ("right") and 1=0 
("wrong**) I then this equation expresses the ratio of the probabilities of the 
possible outcoaes as a product of teras which relate each coaponent of each 
facet with the local origin of its subscale. These teras are independent of 
whichever other exaaineesi iteas and judges are included in the test 
situ&tion. To consider these teras in an additive way, we can take 
logarithas and assign a nuaerical direction to each tera in accei*dance with 
conventional interpretation. Let 

Bn s log(Pn001/PnOOO^ which is defined to be the ability of an exaainee 
Di =: log(P0i00/P0i01)i which is defined to be the difficulty of an itea 
Cj = log(POOjO/POOjl)i which is defined to be the severity of a judge 

We also define the relationship of the subscales, such that the 
probability of "original" exaainee GO being rated a "1" by judge CO on itea 
AO is 0.5 1 so that the last tera of eq.tation (21) becoaes 0. . 

Reparaaeterizing Pnij = Pnijl, so that 1 - Pnij = PnijO, gives equation 
(22) I the three- faceted Rasch aodel for the dichotoaous case. 

log(Pnij/(l-Pnij)) = Bn - Di - Cj (22) 



V. The three- faceted rating scale aodel. 

ffhen we consider exaainees On and Oa in the acre general circuastances of a 
rating scale, we do not wish the coaparison of their abilities to depend on 
which particular pair of categories of the rating scale are used for the 



comparison* So we return to equation (11) which stated: 



Perforaance Isvel of On Pkl Pnijk Piijl 

Performance level of Oa Plk Pnijl Piijk 

We wish to generalize this equation to any pair of categoriesi but the rating 
scale categories are not independent but structured* In order to detos^ine 
the structure in an objective manner t we require that performance levels are 
invariant when they are compared using any pair of adjacent categories in 
ascending order* This is the only possible objective structuring since 
invar iancei which is not over adjacent categories » but over some pairing of 
non-adjacent categories results in a contradiction or indeterminacy in the 
rating scale structure. Thus, if a rating scale has 3 categories and 
performance levels are to be invariant only when the top and bottom 
categories are used for the comparison in (11) i then performance levels based 
on the middle category are indeterminate, and are not objective* 

Invariance in relative performance when categories are chosen such that k is 
one greater than 1, and also k* is chosen one greater than 1*, yields 

Performance level of On Pnijk Pmijl Pnijk* Pmijl' 

= ♦ = :^ (23) 

Performance level of Om Pnijl Pwijk Pnijl* Pmijk* 
so that 

Pnijk Pnijk* Pmijk Pmijl* 

= * ♦ (24) 

Pnijl Pnijl* Pmijl Pmijk* 

Since we want this result to be generalizable, we must be able to substitute 
examinee 00, item AO, and judge CO, 

PnOOk PnOOk* POOOk POOOl* 

^ « (25) 

PnOOl PnOOl* POOOl POOOk* 

Reordering t^e terms, 

PnOOk PnOOk* POOOl* POOOk 

s ( ♦ ) « (26) 

PnOOl PnOOl* POOOk* POOOl 

the two terms in parentheses are invariant over changes in choice of pairs of 
categories and so are independent of the local structure of the rating scale, 
but they are not independent of the choice of object, so we can accordingly 
rewk^ite them as PnOO, so that 

PnOdk POOOk 

PnOO * (27) 

PnOOl POOOl 
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Similar eguations hold for POiOk/POlOl and POOjk/POOjl, so that, substituting 
into (21) I 

Puiik POOOk 

PnOO * POiO * POOj * (28) 

Pnijl POOOl 

We have an equation in which the ratio of the probabilities of particular 
outcomes is the product of terns which depend only on a single component and 
the local origin of its subscale, coabined with a tern dependent on the pair 
of categorizes used for the coaparison. Let 

Bn ^ log(PnOO)i which is defined to be the exaainee ability. 
Di = - log(POiO), which is defined to be the itea difficulty, 
C j = - log(POOj), which is defined to be the judge severity 
Pk s - log(P000k/P0001), which is defined to be the difficulty of the 
step froa category k-1 (=1) to category k of the rating scale. 

Then equation (28), the three-faceted Rasch rating scale aodel becoaes 

log(Pnijk/Pnijk-l) = Bn - Di - Cj - Fk (29) 

In (29), the paraaeters relating to the particular exafiinees, iteas, and 
judges interacting to aake each rating have been separated, and so (29) is 
objective in that the paraaeters of the particular exaainees, judges and 
iteas enter independently into the rating process. Neverthess, the paraaeter 
estiaate of fti, say, could be inflated by an arbitrary aaovat so long as the 
other paraaeter estiaates were deflated accordingly. Thus the actual 
placeaent of the exaainee, itea, judge and step subscales within the coaaon 
fraae of reference is arbitrary. By convention, local origins for each 
subscale are chosen such that the aean calibrations of the iteas, of the 
judges, and of the rating scale steps are each sero. The local origin of 
the exaainee abilities* subscale is then defined uniquely by the aodel. 

VI . Conclusion% 

The aanner In vhich the dichotoaous two-*faceted Rasch aodel can be derived 
froa objectivity has been deaonstrated to extend to an exaaple of the 
aany-faceted aodel which includes a rating scale. Any other particular fora 
of the aany-faceted aodel aay also be derived in a siailar way. 
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